NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Label-free optical mapping for large-area biomechanical dynamics of multicellular systems

https://doi.org/10.1016/j.bios.2025.117281

Lin, Yen-Ju; Tan, Xing_Haw Marvin; Wang, Yijie; Chung, Pei-Shan; Zhang, Xiang; Wu, Ting-Hsiang; Wu, Tung-Yu; Deb, Arjun; Chiou, Pei-Yu (June 2025, Biosensors and Bioelectronics)

Free, publicly-accessible full text available June 1, 2026
Distributionally Robust Observable Strategic Queues

https://doi.org/10.1287/stsy.2022.0009

Wang, Yijie; Prasad, Madhushini Narayana; Hanasusanto, Grani A; Hasenbein, John J (September 2024, Stochastic Systems)

This paper presents an extension of Naor’s analysis on the join-or-balk problem in observable M/M/1 queues. Although all other Markovian assumptions still hold, we explore this problem assuming uncertain arrival rates under the distributionally robust settings. We first study the problem with the classical moment ambiguity set, where the support, mean, and mean-absolute deviation of the underlying distribution are known. Next, we extend the model to the data-driven setting, where decision makers only have access to a finite set of samples. We develop three optimal joining threshold strategies from the perspectives of an individual customer, a social optimizer, and a revenue maximizer such that their respective worst-case expected benefit rates are maximized. Finally, we compare our findings with Naor’s original results and the traditional sample average approximation scheme. Funding: This research was supported by the National Science Foundation [Grants 2342505 and 2343869].
more » « less
Full Text Available
Wasserstein Robust Classification with Fairness Constraints

https://doi.org/10.1287/msom.2022.0230

Wang, Yijie; Nguyen, Viet Anh; Hanasusanto, Grani A (July 2024, Manufacturing & Service Operations Management)

Problem definition: Data analytics models and machine learning algorithms are increasingly deployed to support consequential decision-making processes, from deciding which applicants will receive job offers and loans to university enrollments and medical interventions. However, recent studies show these models may unintentionally amplify human bias and yield significant unfavorable decisions to specific groups. Methodology/results: We propose a distributionally robust classification model with a fairness constraint that encourages the classifier to be fair in the equality of opportunity criterion. We use a type-[Formula: see text] Wasserstein ambiguity set centered at the empirical distribution to represent distributional uncertainty and derive a conservative reformulation for the worst-case equal opportunity unfairness measure. We show that the model is equivalent to a mixed binary conic optimization problem, which standard off-the-shelf solvers can solve. We propose a convex, hinge-loss-based model for large problem instances whose reformulation does not incur binary variables to improve scalability. Moreover, we also consider the distributionally robust learning problem with a generic ground transportation cost to hedge against the label and sensitive attribute uncertainties. We numerically examine the performance of our proposed models on five real-world data sets related to individual analysis. Compared with the state-of-the-art methods, our proposed approaches significantly improve fairness with negligible loss of predictive accuracy in the testing data set. Managerial implications: Our paper raises awareness that bias may arise when predictive models are used in service and operations. It generally comes from human bias, for example, imbalanced data collection or low sample sizes, and is further amplified by algorithms. Incorporating fairness constraints and the distributionally robust optimization (DRO) scheme is a powerful way to alleviate algorithmic biases. Funding: This work was supported by the National Science Foundation [Grants 2342505 and 2343869] and the Chinese University of Hong Kong [Grant 4055191]. Supplemental Material: The online appendices are available at https://doi.org/10.1287/msom.2022.0230 .
more » « less
Full Text Available
Synthesis of P-Containing Polycyclic Aromatic Hydrocarbons from Alkynyl-phosphonium Salts

https://doi.org/10.1021/acs.orglett.4c01579

Wang, Yijie; Su, Guangchen; Li, Mingsheng; Yao, Li; Chalifoux, Wesley A; Yang, Wenlong (June 2024, Organic Letters)

Full Text Available
Bias-aware Boolean Matrix Factorization Using Disentangled Representation Learning

Wang, Xiao; Wang, Jia; Zhao, Tong; Wang, Yijie; Zhang, Nan; Zang, Yong; Cao, Sha; Zhang, Chi (July 2024, Proceedings of Machine Learning Research)

Boolean matrix factorization (BMF) has been widely utilized in fields such as recommendation systems, graph learning, text mining, and -omics data analysis. Traditional BMF methods decompose a binary matrix into the Boolean product of two lower-rank Boolean matrices plus homoscedastic random errors. However, real-world binary data typically involves biases arising from heterogeneous row- and column-wise signal distributions. Such biases can lead to suboptimal fitting and unexplainable predictions if not accounted for. In this study, we reconceptualize the binary data generation as the Boolean sum of three components: a binary pattern matrix, a background bias matrix influenced by heterogeneous row or column distributions, and random flipping errors. We introduce a novel Disentangled Representation Learning for Binary matrices (DRLB) method, which employs a dual auto-encoder network to reveal the true patterns. DRLB can be seamlessly integrated with existing BMF techniques to facilitate bias-aware BMF. Our experiments with both synthetic and real-world datasets show that DRLB significantly enhances the precision of traditional BMF methods while offering high scalability. Moreover, the bias matrix detected by DRLB accurately reflects the inherent biases in synthetic data, and the patterns identified in the bias-corrected real-world data exhibit enhanced interpretability.
more » « less
Full Text Available
Bias-aware Boolean Matrix Factorization Using Disentangled Representation Learning

Wang, Xiao; Wang, Jia; Zhao, Tong; Wang, Yijie; Zhang, Nan; Zang, Yong; Cao, Sha; Zhang, Chi (April 2024, The 40th Conference on Uncertainty in Artificial Intelligence)

Boolean matrix factorization (BMF) has been widely utilized in fields such as recommendation systems, graph learning, text mining, and -omics data analysis. Traditional BMF methods decompose a binary matrix into the Boolean product of two lower-rank Boolean matrices plus homoscedastic random errors. However, real-world binary data typically involves biases arising from heterogeneous row- and column-wise signal distributions. Such biases can lead to suboptimal fitting and unexplainable predictions if not accounted for. In this study, we reconceptualize the binary data generation as the Boolean sum of three components: a binary pattern matrix, a background bias matrix influenced by heterogeneous row or column distributions, and random flipping errors. We introduce a novel Disentangled Representation Learning for Binary matrices (DRLB) method, which employs a dual auto-encoder network to reveal the true patterns. DRLB can be seamlessly integrated with existing BMF techniques to facilitate bias-aware BMF. Our experiments with both synthetic and real-world datasets show that DRLB significantly enhances the precision of traditional BMF methods while offering high scalability. Moreover, the bias matrix detected by DRLB accurately reflects the inherent biases in synthetic data, and the patterns identified in the bias-corrected real-world data exhibit enhanced interpretability.
more » « less
Full Text Available
Generalized Matrix Local Low Rank Representation by Random Projection and Submatrix Propagation

https://doi.org/10.1145/3580305.3599361

Dang, Pengtao; Zhu, Haiqi; Guo, Tingbo; Wan, Changlin; Zhao, Tong; Salama, Paul; Wang, Yijie; Cao, Sha; Zhang, Chi (August 2023, ACM)

Matrix low rank approximation is an effective method to reduce or eliminate the statistical redundancy of its components. Compared with the traditional global low rank methods such as singular value decomposition (SVD), local low rank approximation methods are more advantageous to uncover interpretable data structures when clear duality exists between the rows and columns of the matrix. Local low rank approximation is equivalent to low rank submatrix detection. Unfortunately,existing local low rank approximation methods can detect only submatrices of specific mean structure, which may miss a substantial amount of true and interesting patterns. In this work, we develop a novel matrix computational framework called RPSP (Random Probing based submatrix Propagation) that provides an effective solution for the general matrix local low rank representation problem. RPSP detects local low rank patterns that grow from small submatrices of low rank property, which are determined by a random projection approach. RPSP is supported by theories of random projection. Experiments on synthetic data demonstrate that RPSP outperforms all state-of-the-art methods, with the capacity to robustly and correctly identify the low rank matrices when the pattern has a similar mean as the background, background noise is heteroscedastic and multiple patterns present in the data. On real-world datasets, RPSP also demonstrates its effectiveness in identifying interpretable local low rank matrices.
more » « less
Full Text Available
Language brokering and immigrant-origin youth’s well-being: A meta-analytic review.

https://doi.org/10.1037/amp0001035

Shen, Yishan; Seo, Eunjin; Jiles, Alison I.; Zheng, Yao; Wang, Yijie (October 2022, American Psychologist)

Full Text Available
Pipeline for characterizing alternative mechanisms (PCAM) based on bi-clustering to study colorectal cancer heterogeneity

https://doi.org/10.1016/j.csbj.2023.03.028

Cao, Sha; Chang, Wennan; Wan, Changlin; Lu, Xiaoyu; Dang, Pengtao; Zhou, Xinyu; Zhu, Haiqi; Chen, Jian; Li, Bo; Zang, Yong; et al (January 2023, Computational and Structural Biotechnology Journal)

Full Text Available
FLUXestimator: a webserver for predicting metabolic flux and variations using transcriptomics data

https://doi.org/10.1093/nar/gkad444

Zhang, Zixuan; Zhu, Haiqi; Dang, Pengtao; Wang, Jia; Chang, Wennan; Wang, Xiao; Alghamdi, Norah; Lu, Alex; Zang, Yong; Wu, Wenzhuo; et al (May 2023, Nucleic Acids Research)

Abstract Quantitative assessment of single cell fluxome is critical for understanding the metabolic heterogeneity in diseases. Unfortunately, laboratory-based single cell fluxomics is currently impractical, and the current computational tools for flux estimation are not designed for single cell-level prediction. Given the well-established link between transcriptomic and metabolomic profiles, leveraging single cell transcriptomics data to predict single cell fluxome is not only feasible but also an urgent task. In this study, we present FLUXestimator, an online platform for predicting metabolic fluxome and variations using single cell or general transcriptomics data of large sample-size. The FLUXestimator webserver implements a recently developed unsupervised approach called single cell flux estimation analysis (scFEA), which uses a new neural network architecture to estimate reaction rates from transcriptomics data. To the best of our knowledge, FLUXestimator is the first web-based tool dedicated to predicting cell-/sample-wise metabolic flux and metabolite variations using transcriptomics data of human, mouse and 15 other common experimental organisms. The FLUXestimator webserver is available at http://scFLUX.org/, and stand-alone tools for local use are available at https://github.com/changwn/scFEA. Our tool provides a new avenue for studying metabolic heterogeneity in diseases and has the potential to facilitate the development of new therapeutic strategies.
more » « less

« Prev Next »

Search for: All records